AITopics | winograd convolution

Collaborating Authors

winograd convolution

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

400a2e6a82520b690810b97fd67fcc4e-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 21:56:20 GMT

convolution, quantization, winograd convolution, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(10 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Vision (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

A Related Works 318 Private inference has been a promising solution to protect both data and model privacy during deep

Neural Information Processing SystemsNov-20-2025, 01:28:21 GMT

Private inference framework CoPriv adopts CypTFlow2 [39] protocol for private inference. MBps bandwidth and 80ms echo latency. We train our proposed CoPriv with self-distillation. For the network re-parameterization mentioned in Section 4.2, here we provide the For some input size, the input cannot be covered by tiles. The correctness and equivalence can be proved with Eq. 1. Also, [ Winograd convolution for stride of 2. The algorithm can be nested with itself to obtain a 2D algorithm The correctness analysis is the same with Section D.3.

artificial intelligence, inference, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Genre: Research Report > Promising Solution (0.40)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)

Add feedback

400a2e6a82520b690810b97fd67fcc4e-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 23:12:25 GMT

artificial intelligence, machine learning, quantization, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(10 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Vision (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales

Pan, Shuokai, Tuzi, Gerti, Sreeram, Sudarshan, Gope, Dibakar

arXiv.org Artificial IntelligenceDec-27-2024

Despite the revolutionary breakthroughs of large-scale textto-image diffusion models for complex vision and downstream tasks, their extremely high computational and storage costs limit their usability. Quantization of diffusion models has been explored in recent works to reduce compute costs and memory bandwidth usage. To further improve inference time, fast convolution algorithms such as Winograd can be used for convolution layers, which account for a significant portion of computations in diffusion models. However, the significant quality loss of fully quantized Winograd using existing coarser-grained post-training quantization methods, combined with the complexity and cost of finetuning the Winograd transformation matrices for such large models to recover quality, makes them unsuitable for large-scale foundation models. Motivated by the presence of a large range of values in them, we investigate the impact of finer-grained group-wise quantization in quantizing diffusion models. While group-wise quantization can largely handle the fully quantized Winograd convolution, it struggles to deal with the large distribution imbalance in a sizable portion of the Winograd domain computation. To reduce range differences in the Winograd domain, we propose finetuning only the scale parameters of the Winograd transform matrices without using any domain-specific training data. Because our method does not depend on any training data, the generalization performance of quantized diffusion models is safely guaranteed. For text-to-image generation task, the 8-bit fully-quantized diffusion model with Winograd provides near-lossless quality (FID and CLIP scores) in comparison to the full-precision model. For image classification, our method outperforms the state-of-the-art Winograd PTQ method by 1.62% and 2.56% in top-1 ImageNet accuracy on ResNet18 and ResNet-34, respectively, with Winograd F(6, 3).

artificial intelligence, machine learning, quantization, (16 more...)

arXiv.org Artificial Intelligence

2412.19867

Country:

Europe > Italy > Lombardy > Milan (0.04)
Europe > France (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Exploring Winograd Convolution for Cost-effective Neural Network Fault Tolerance

Xue, Xinghua, Liu, Cheng, Liu, Bo, Huang, Haitong, Wang, Ying, Luo, Tao, Zhang, Lei, Li, Huawei, Li, Xiaowei

arXiv.org Artificial IntelligenceAug-16-2023

Winograd is generally utilized to optimize convolution performance and computational efficiency because of the reduced multiplication operations, but the reliability issues brought by winograd are usually overlooked. In this work, we observe the great potential of winograd convolution in improving neural network (NN) fault tolerance. Based on the observation, we evaluate winograd convolution fault tolerance comprehensively from different granularities ranging from models, layers, and operation types for the first time. Then, we explore the use of inherent fault tolerance of winograd convolution for cost-effective NN protection against soft errors. Specifically, we mainly investigate how winograd convolution can be effectively incorporated with classical fault-tolerant design approaches including triple modular redundancy (TMR), fault-aware retraining, and constrained activation functions. According to our experiments, winograd convolution can reduce the fault-tolerant design overhead by 55.77\% on average without any accuracy loss compared to standard convolution, and further reduce the computing overhead by 17.24\% when the inherent fault tolerance of winograd convolution is considered. When it is applied on fault-tolerant neural networks enhanced with fault-aware retraining and constrained activation functions, the resulting model accuracy generally shows significant improvement in presence of various faults.

artificial intelligence, convolution, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2308.0823

Country:

Europe (0.04)
Asia > China > Beijing > Beijing (0.04)
Asia > Singapore (0.04)

Genre: Research Report (0.64)

Industry: Information Technology (0.68)

Technology:

Information Technology > Architecture (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile

Andri, Renzo, Bussolino, Beatrice, Cipolletta, Antonio, Cavigelli, Lukas, Wang, Zhe

arXiv.org Artificial IntelligenceSep-26-2022

Most of today's computer vision pipelines are built around deep neural networks, where convolution operations require most of the generally high compute effort. The Winograd convolution algorithm computes convolutions with fewer MACs compared to the standard algorithm, reducing the operation count by a factor of 2.25x for 3x3 convolutions when using the version with 2x2-sized tiles $F_2$. Even though the gain is significant, the Winograd algorithm with larger tile sizes, i.e., $F_4$, offers even more potential in improving throughput and energy efficiency, as it reduces the required MACs by 4x. Unfortunately, the Winograd algorithm with larger tile sizes introduces numerical issues that prevent its use on integer domain-specific accelerators and higher computational overhead to transform input and output data between spatial and Winograd domains. To unlock the full potential of Winograd $F_4$, we propose a novel tap-wise quantization method that overcomes the numerical issues of using larger tiles, enabling integer-only inference. Moreover, we present custom hardware units that process the Winograd transformations in a power- and area-efficient way, and we show how to integrate such custom modules in an industrial-grade, programmable DSA. An extensive experimental evaluation on a large set of state-of-the-art computer vision benchmarks reveals that the tap-wise quantization algorithm makes the quantized Winograd $F_4$ network almost as accurate as the FP32 baseline. The Winograd-enhanced DSA achieves up to 1.85x gain in energy efficiency and up to 1.83x end-to-end speed-up for state-of-the-art segmentation and detection networks.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2209.12982

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Washington > King County > Renton (0.04)
North America > United States > District of Columbia > Washington (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Fast Convolution based on Winograd Minimum Filtering: Introduction and Development

Tong, Gan, Huang, Libo

arXiv.org Artificial IntelligenceNov-1-2021

Convolutional Neural Network (CNN) has been widely used in various fields and played an important role. Convolution operators are the fundamental component of convolutional neural networks, and it is also the most time-consuming part of network training and inference. In recent years, researchers have proposed several fast convolution algorithms including FFT and Winograd. Among them, Winograd convolution significantly reduces the multiplication operations in convolution, and it also takes up less memory space than FFT convolution. Therefore, Winograd convolution has quickly become the first choice for fast convolution implementation within a few years. At present, there is no systematic summary of the convolution algorithm. This article aims to fill this gap and provide detailed references for follow-up researchers. This article summarizes the development of Winograd convolution from the three aspects of algorithm expansion, algorithm optimization, implementation, and application, and finally makes a simple outlook on the possible future directions.

convolution, convolutional neural network, winograd convolution, (10 more...)

arXiv.org Artificial Intelligence

doi: 10.5121/csit.2021.111716

2111.00977

Country:

North America > United States > California > San Francisco County > San Francisco (0.28)
Europe > Austria > Vienna (0.14)
Asia > Macao (0.14)
(31 more...)

Genre: Research Report (0.51)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Efficient Residue Number System Based Winograd Convolution

Liu, Zhi-Gang, Mattina, Matthew

arXiv.org Machine LearningJul-23-2020

Prior research has shown that Winograd algorithm can reduce the computational complexity of convolutional neural networks (CNN) with weights and activations represented in floating point. However it is difficult to apply the scheme to the inference of low-precision quantized (e.g. INT8) networks. Our work extends the Winograd algorithm to Residue Number System (RNS). The minimal complexity convolution is computed precisely over large transformation tile (e.g. 10 x 10 to 16 x 16) of filters and activation patches using the Winograd transformation and low cost (e.g. 8-bit) arithmetic without degrading the prediction accuracy of the networks during inference. The arithmetic complexity reduction is up to 7.03x while the performance improvement is up to 2.30x to 4.69x for 3 x 3 and 5 x 5 filters respectively.

artificial intelligence, machine learning, residue number system, (18 more...)

arXiv.org Machine Learning

2007.12216

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Searching for Winograd-aware Quantized Networks

Fernandez-Marques, Javier, Whatmough, Paul N., Mundy, Andrew, Mattina, Matthew

arXiv.org Machine LearningFeb-25-2020

Lightweight architectural designs of Convolutional Neural Networks (CNNs) together with quantization have paved the way for the deployment of demanding computer vision applications on mobile devices. Parallel to this, alternative formulations to the convolution operation such as FFT, Strassen and Winograd, have been adapted for use in CNNs offering further speedups. Winograd convolutions are the fastest known algorithm for spatially small convolutions, but exploiting their full potential comes with the burden of numerical error, rendering them unusable in quantized contexts. In this work we propose a Winograd-aware formulation of convolution layers which exposes the numerical inaccuracies introduced by the Winograd transformations to the learning of the model parameters, enabling the design of competitive quantized models without impacting model size. We also address the source of the numerical error and propose a relaxation on the form of the transformation matrices, resulting in up to 10% higher classification accuracy on CIFAR-10. Finally, we propose wiNAS, a neural architecture search (NAS) framework that jointly optimizes a given macro-architecture for accuracy and latency leveraging Winograd-aware layers. A Winograd-aware ResNet-18 optimized with wiNAS for CIFAR-10 results in 2.66x speedup compared to im2row, one of the most widely used optimized convolution implementations, with no loss in accuracy.

accuracy, convolution, winograd convolution, (15 more...)

arXiv.org Machine Learning

2002.10711

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(3 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Filters

Collaborating Authors

winograd convolution

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

400a2e6a82520b690810b97fd67fcc4e-Paper-Conference.pdf

A Related Works 318 Private inference has been a promising solution to protect both data and model privacy during deep

400a2e6a82520b690810b97fd67fcc4e-Paper-Conference.pdf

f96839fc751b67492e17e70f5c9730e4-Supplemental-Conference.pdf

Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales

Exploring Winograd Convolution for Cost-effective Neural Network Fault Tolerance

Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile

Fast Convolution based on Winograd Minimum Filtering: Introduction and Development

Efficient Residue Number System Based Winograd Convolution

Searching for Winograd-aware Quantized Networks